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Abstract 

Pathwise predictability of continuous time processes is studied in deterministic 
setting. We discuss uniform prediction in some weak sense with respect to certain 
classes of inputs. More precisely, we study possibility of approximation of convolution 
integrals over future time by integrals over past time. We found that all band-limited 
processes are predictable in this sense, as well as high-frequency processes with zero 
energy at low frequencies. It follows that a process of mixed type still can be predicted 
if an ideal low-pass filter exists for this process. 
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1 Introduction 

We study pathwise predictability of continuous time processes in deterministic setting. 
It is well known that certain restrictions on frequency distribution can ensure additional 
opportunities for prediction and interpolation of the processes. The classical result is 
Nyquist-Shannon-Kotelnikov interpolation theorem for the low-band processes. There are 
related predictability results for low-band processes (see, e.g., Wainstein and Zubakov 
(1962), Beutler (1966), Brown(1969), Slepian (1978), Knab (1981), Papoulis (1985), Mar- 
vasti (1986), Vaidyanathan (1987), Lyman et al (2000, 2001)). 

In the present paper, we study a special kind of weak predictability such that convo- 
lution integrals over future can be approximated by convolution integrals over past times 
representing historical observations. We found some cases when this approximation can 
be made uniformly over a wide class of input processes. We found that all band-limited 



processes are predictable in this sense. Similar result is obtained for high-frequency pro- 
cesses. For the processes of mixed type, we found that the similar predictability can be 
achieved when the model allows a low pass filter that acts as an ideal low-pass filter for 
this process. These results can be a useful addition to the existing theory of band-limited 
processes. The novelty is that we consider predictability of both high frequent and band- 
limited processes in a weak sense uniformly over classes of input processes. In addition, 
we suggest a new type of predictor. Its kernel is given explicitly in the frequency domain. 



2 Definitions 

Let I denote the indicator function, R + = [0, +00), C + = {z G C : Rez > 0}, i = 

For complex valued functions x G Li(R) one I>2(R), we denote by X = Tx the 
function defined on iR as the Fourier transform of x: 



/oo 
e- iujt x(t)dt, w G R. 
-00 



If x G L2(R), then X is defined as an element of L2(R). 

For v(-) G L,2(R) such that v (t) = for t < 0, we denote by Cv the Laplace transform 



m = I 

Jo 



00 

pt 



V(p) = (Cv)(p) = / e' pt v{t)dt, p G C+ (2.1) 



Let H r be the Hardy space of holomorphic on C + functions h(p) with finite norm 
\\h\\ H r = sup s>0 \\h(s + iu;)\\ Lr{R) , r G [1, +00] (see, e.g., Duren (1970)). 
Let Q > be given. 

Definition 1 Let K. be the class of functions k : R — > R such that k(t) = for t > and 
such that K = Tk is 

«™ - d M <2 - 2) 

where d(-) and £(•) are polynomials such that degd < deg5, and if S(p) = for p G C 
then Rep > 0, |Imp| < $7. 

Note that the class K, is quite wide: it consists of linear combinations of functions 
g(t)e A *I{ t<0 }, where A G C, Re A > 0, |ImA| < $7, and where q(t) is a polynomial. 

Definition 2 Let K. be the class of functions k : R — > R such that k(t) = for t < and 
such that K(-) = Ck G H 2 n 



We are going to study linear predictors in the form y(t) = J_ (x> k(t — s)x(s)ds for the 
processes y{t) = k(t — s)x(s)ds, where k G K and k £ K. The predictors use 

historical values of currently observable process x(-). 



2 



Definition 3 Corrections: 

On Endnote 1: it should be "Let V be as defined above" 

On Endnote 2: it should be "are such as required in statements (ii)-(iii)". 

On Endnote 3: it should be "Let V and K be as defined above" 

Additional correction: P. 6, Line 25: please put "oj G D e " instead of "uj G D". 

P. 7, Line 9: "Academic Press" instead of "Academic" 

P. 7, Line 18: "Fourier analysis, and uncertainty" instead of "Fourier analysis and 
uncertainty" 

Let X = be a class of functions x : R — * C. Let r G [1, +oo]. 

(i) We say that the class X is L r -predictable in the weak sense if, for any k(-) G /C, 
there exists a sequence {k m (-)}^^ 1 = {k m (-, X, k)}^^ 1 C /C such that 

\\y- 2/m|U r (R) -> as m^+oo Mx G X, 

where 

r+oo ft ^ 

y(t) = k(t- s)x(s)ds, y m {t) = / k m (t - s)x(s)ds. 

J t J — oo 

(ii) Let the set J-(X) = {X = Tx, x G X} be provided with a norm \\ • ||. We say that 
the class X is L r -predictable in the weak sense uniformly with respect to the norm 
|| • ||, if, for any k(-) G K, and e > 0, there exists k(-) = k(-,X,k, \\ ■ G K, such 
that 

h - y IU r (R) < £ ll^ll Vx £ x, x = Tx. 

Here y{-) is the same as above, y(t) = f_ k(t — s)x(s)ds. 
We call functions k(-) in Definition [3] predictors or predicting kernels. 

3 The main result 

Let £1 > be the same as in the definition of JC, and let 

X L = {x(-) G L a (R) : X(lo) =0 if \u\ > Q, X = Fx}, 
X H = {x(-) G L 2 (R) : X(uj) = if \u\ < O, X = Fx}. 

In particular, Xl is a class of band-limited processes, and Xjj is a class of high-frequency 
processes. 
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3.1 Predictability of band-limited and high-frequency processes from L 2 
Theorem 1 (i) The classes Xl and Xh are L,2-predictable in the weak sense, 
(ii) The classes Xl and Xjj are Lao-predictable in the weak sense uniformly with respect 



(Hi) For any q > 2, the classes Xl and Xh are L2 -predictable in the weak sense uniformly 
with respect to the norm \\ ■ \\L q (R)- 

Remark 1 Since the constant is the same for the classes K,, Xl, Xh, the set ofk(-) £ K, 
such that the corresponding processes y(-) can be predicted is restricted for x(-) £ Xh- On 
the other hand, these restrictions are absent for band-limited processes x(-) £ Xl, since 
they are automatically included to all similar classes with larger £1, i.e., the constant Vt in 
the definition of Xl can always be increased. 

The question arises how to find the predicting kernels. In the proof of Theorem [H a 
possible choice of the kernels is given explicitly via Fourier transforms. 

3.2 Predictability for some bounded processes 

Let C(R) be the Banach space of all bounded and continuous functions / : R — > C, and let 
C(R)* be the dual space for C(R), i.e., it is the space of all linear continuous functionals 
£ : C(R) -> C (see, e.g., Yosida (1980)). 

Let Moo be the class of all processes x{t) : R — > C such that there exists a function 
X c G Li(R), a sequence C R, and a sequence {cfe}^^ C C such that Ylt=i \ c k\ < 

+00 and 



Clearly, any set X = ({wfc}^^, {ck}^^x,X c ) with the required properties is uniquely 
defined by the process x £ .Moo, and can be associated with an unique element of C(R)* 
such that 



In particular, x(t) = (■n=e it ',X) for all t. We will denote this relationship as X = Tx, 
using the same notation as for the Fourier transform, and we extend Definition on this 
case (it is a frequency representation, but not a Fourier transform anymore). As required 
in Definition [3l we provide the set {X} of these sets X with the norm || • ||c7(r)*- 



to the norm 



L 2 (R)- 





k=l 



4 



If x G .Moo, then \x(t)\ < (27r) _1 ||e l *'||c( R )||X||c( R )*. Hence all functions from Moo 
are bounded on R. 

Let e G (0, Q) be given. Let 

Ml = {x G Moo ■ |wfc| < - e (Vfc), suppX c C [— O + e, - e]|, 

■Mi/ = |z € >ioo : |o7fc| > + e (Vfc), suppX c C (-00, -17 - e] U [O + e, +00)}. 

Ml is a class of band-limited processes, and Mh is a class of high-frequency processes. 

Theorem 2 T/ie classes Ml and Mh are Loo-predictable in the weak sense uniformly 
with respect to the norm \\ ■ \\ccr)*- 

4 On a model with ideal low pass-pass filter 

Corollary 1 Assume a model with a process x(-) such that an observer is able to decom- 
pose it as x(t) = XL^t) + XH{t), where xl(-) G Xl U Ml and x//(-) G Xjj U Mh- Then 
this observer would be able to predict (approximately, in the sense of weak presdictability) 
the values of y{t) = / t + °° k(t — s)x(s)ds for k(-) G /C by predicting the processes j/z(i) = 
J^~°° k(t — s)xL(s)ds and yn{t) = k(t — s)xn(s)ds separately. More precisely, the 
process y(t) = yi(t) + yn{t) is the prediction ofy(t), where y/,(t) = J** — s)xL(s)ds 

and y//(t) = /_ . kji(t — s)xH(s)ds, and where fc/,(-) and &//(•) are predicting kernels which 
existence for the processes xl(-) and £//(•) is established above. 

Let X2,M = !{|a;|<n} and X//H = 1 - XiM = I{|w|>J2}> wnere w G R. 

The assumptions of Corollary [1] mean that there are a low-pass filter and a high-pass 
filter with the transfer functions \L and \H respectively, with x(-) as the input, i.e., that 
the values xl(s) and xh(s) for s < t are available at time t, where 

x L (-)=T- l X L , X l {uj) = xl{w)X{u), i fl (-) = rt Fl ^rxffW), 



and where X = J^x. It follows that the predictability in the weak sense described in 
Definition [3] is possible for any process x(-) that can be decomposed without error on a 
band limited process and a high-frequency process, i.e., when there is a low-pass filters 
which behaves as an ideal filter for this process. (Since x//(i) = x(t) — x/,(i), existence of 
the law pass filter implies existence of the high pass filter). On the other hand, Corollary 
Q] implies that the existence of ideal low-pass filters is impossible for general processes, 
since they cannot be predictable in the sense of Definition [3l 
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Clearly, processes x(-) G Xi U Xjj U .Ml U M.h are automatically covered by Corollary 
[H i.e., the existence of the filters is not required for this case. For instance, we have 
immediately that xl(-) = x(-) and xh(-) = for band- limited processes. 

5 Proofs 

Let fc(-) G K and = JTs. Let J22J) holds with £(p) = J]™ = i $m(p), wh ere J m (p) = 

p — a m + 6 m i, and where a m ,b m G R, p G C. By the assumptions on /C, we have that 
a m > and \b m \ < Q. 

It suffices to present a set of predicting kernels k with desired properties. We will 
use a version of the construction introduced in Dokuchaev (1996) for an optimal control 
problem. This construction is very straightforward and does not use the advanced theory 
of /P-spaces. 

For 7 G R, set 

a m = ^™, V m (p)±l-exp( 1 p - am + b h mi ) , V( P )±flv m (p), 
a m V P + a m - b m ij . 

v ' m=l 

K{iu) = V{iuj)K(iLo). 



Lemma 1 (%) V(p) £ H 2 D H°° and K(p) = K(p)V{p) G H 2 n 

(jj) If j > and u> G [— f2, 0], £/ien |V(zu;)| < 1. If "f < 0, and i/w £ R, |cj| > Q, then 
\V(iu)\ < 1. 

(mj if a; G (— Q, f2), i/ien V(zu;) — > 1 as 7 — > +00. /fa; G R and |oj| > £1, then V(iu) — > 1 
as 7 — > — 00. 

(wj For any e > 0, V(iu;) — ► 1 as 7 — > +00 uniformly inu; G [— f2 + e, fi — e] as 7 — > +00, 
and V(ia;) — > 1 as 7 — > —00 uniformly in u G R sitca i/iai |w| > + e. 

Proo/ 0/ Lemma [7J Clearly, V^(p) G and 5 m (p) _1 Kn(p) G i/ 2 D i/°°, since the 

pole of <5 m (p) _1 is being compensated by multiplying on V^(p). It follows that K(p)V(p) G 
# 2 n H°°. Then statement (i) follows. 

Further, for 00 G R, 

ioj - a m + b m i _ (— Orn + w + ib m ){a m - iuj + b m i) 
iu + a m - b m i (u) - b m ) 2 + a 2 ,, 

-a m a m + (a; + b m )(uj - 

bm) ■ ~ ' a m(yJ + b m ) + 0£ m (u> + 6 m ) 

(a; - 6 m ) 2 + a 2 , (a; - 6 m ) 2 + a 2 m 
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Then 

iu - a m + b m i _ -a m am + ^ 2 - b 2 m __ to 2 - O, 2 

iio + a m - b m i (u - b m ) 2 + a 2 m (to - b m ) 2 + a 2 m ' 

Then statements (ii)-(iv) follow. This completes the proof of Lemma [TJ □ 

Proof of Theorem^ For x(-) G L 2 (R), let X = jFx, Y = Ty = K(iuj)X(uj). Let V be 
as defined above. Set Y(uj) = K(iu)X{uS) = V(iuj)Y(uj). 

Let us consider the cases of Xl and Xh simultaneously. For the case of the class Xl, 
consider 7 > and assume that 7 > and 7 — ► +00. Set D = [— Q,Q] for this case. 
For the case of the class Xh, consider 7 < and assume that 7 < and 7 — » — 00. Set 
D = (—00, —Q,] U [f2, +00) for this case. 

Let x(-) G Xl or x(-) G Xjj- In both cases, Lemma [1] gives that |V(wo)| < 1 for all 
to £ D. If 7 — > +00 or 7 — > —00 respectively for Afj, or Xh cases, then V(iu) — > 1 for a.e. 
uj £ D, i.e., for a.e. w such that X(cj) 7^ 0. 

Let us prove (i). Since K(ioS) G Loo(R) and X G L 2 (R), we have that Y(u;) = 
if(w)X(u;) G L 2 (R) and Y G L 2 (R). By Lemma[H it follows that 

Y(u) -» for a.e. w G R, (5.1) 

as 7 — > +00 or 7 — ► —00 respectively for ^ or ^fjy cases. We have that X G L 2 (R), 
if (iu) G L 2 (R) n Loo(R) and 

\K(iu) - K(iu)\ <\V(iu)-l\\K(iu)\ <2\K(iu)\, uj g D, (5.2) 
|Y(w) < 2\Y(u)\ = 2\K(iu)\\X(u)\, uj G D. (5.3) 

By (|5.ip . (|5.3p . and by Lebesque Dominance Theorem, it follows that 

\\Y-Y\\ L2(R) -h-0, i.e., ||y-y|U 2 (R) ^0 (5.4) 

as 7— > +00 or 7 — > — 00 respectively for A'/, or Xh cases, where y = J 7 ~ 1 Y. 

Let us prove (ii)-(iii). Take d = 1 for (ii) and take d = 2 for (iii). If X G L g (R) for 
q > d, then Holder inequality gives 

r-*1k(R) < \\K(iu)-K(iu)\\ L ^ D) \\X\\ Lq(D) , (5.5) 

where /U is such that 1/fi + l/</ = By (|5.2p and by Lebesque Dominance Theorem 
again, it follows that 

\\K(iu;)-K(iu)\\ LfiiD) ^0 Vm€[1,+oo) (5.6) 
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as 7 — > +00 or 7 — > —00 respectively for Xl or Xfj cases. By (|5.5|) - (|5.6p . it follows that the 
predicting kernels k(-) = k(-,-y) = J r ^ 1 K(iuj) are such as required in statements (ii)-(iii). 
This completes the proof of Theorem [TJ □ 

Proof of Theorem^ For x(-) G such that X = {{^k}t=i,{ c k}t=i^ x c), we have 
that the corresponding set Y = Ty is Y = ({^k} k =iJ {K (iu^)^}^^, K(iuj)X c (uj)^ . Simi- 
larly to X, it can be considered as an element of C(R)* such that y(t) = {±e u \Y). Let 
V and K be as defined above. Set 

Y = ({u k }t™AK{iu k )c k }l™S(iu)X c {u)) . 

It can be seen as an element of C(R)*, and y(t) = k(t — s)x(s)ds = {j-e lt ~ , Y), where 
the kernel is defined via inverse Fourier transform k(-) = T~ l K{iuj). 

We consider the cases of Ml and Mh simultaneously. For the case of the class Ml, 
we consider 7 > and 7 — > +00. Set D £ = [— Q + e, — e\ for this case. For the case of 
the class Mh, we consider 7 < and 7 — ► —00. Set D £ = (—00, — $7 — e] U [CI + e, +00) 
for this case. 

Let x(-) G .Mz, or x(-) G -M/f. In both cases, Lemma [1] gives that |V(zu;)| < 1 for all 
uj G If 7 — > +00 or 7 — > —00 respectively for Ml or 7W_f/ cases, then V(iu;) — ► 1 
uniformly in u G D e . Hence \\K — K^\ L r D \ — > as 7 — ► +00 or 7 — > — 00, for the cases 
of Ml and Mh, respectively. If x G Ml or x G then 



|</,X)| <max|/(i)|||X|| c(R) . V/GC(R), X = Tx. 



Hence 



\y(t)-y(t)\ 



— e **- y _ y 
2tt ' 



±-j\K-K\X 



< ±-\\K-K\ , ,11 V 



2tt 1 



for all i G R. Then the proof of Theorem [2] follows. □ 
Corollary [1] follows immediately from Theorem [TJ 

Remark 2 Formally, the corresponding predictors require the past values of x(s) for all 
s G (— 00, t], but it is not too restrictive, since k(t — s)x(s)ds can be approximated 
by J_w k(t — s)x(s)ds for large enough M > 0. In addition, the corresponding transfer 
functions can be approximated by rational fraction polynomials, and more general kernels 
k can be approximated by kernels from KL. 

Remark 3 The system for the suggested predictors is stable, since the corresponding 
transfer functions have poles in the domain {Re z < 0} only. However, the suggested 
predictors are not robust. For instance, if the predictor is designed for the class Xl and 
it is applied for a process x(-) £ Xl with small non-zero energy at the frequencies outside 
[— then the error generated by the presence of this energy is increasing if 7 — > 00. 
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